Computational methods for the integration of biological activity and chemical space
نویسنده
چکیده
One general aim of medicinal chemistry is the understanding of structure-activity relationships of ligands that bind to biological targets. Advances in combinatorial chemistry and biological screening technologies allow the analysis of ligand-target relationships on a large-scale. However, in order to extract useful information from biological activity data, computational methods are needed that link activity of ligands to their chemical structure. In this thesis, it is investigated how fragment-type descriptors of molecular structure can be used in order to create a link between activity and chemical ligand space. First, an activity class-dependent hierarchical fragmentation scheme is introduced that generates fragmentation pathways that are aligned using established methodologies for multiple alignment of biological sequences. These alignments are then used to extract consensus fragment sequences that serve as a structural signature for individual biological activity classes. It is also investigated how defined, chemically intuitive molecular fragments can be organized based on their topological environment and cooccurrence in compounds active against closely related targets. Therefore, the Topological Fragment Index is introduced that quantifies the topological environment complexity of a fragment in a given molecule, and thus goes beyond fragment frequency analysis. Fragment dependencies have been established on the basis of common topological environments, which facilitates the identification of activity class-characteristic fragment dependency pathways that describe fragment relationships beyond structural resemblance. Because fragments are often dependent on each other in an activity classspecific manner, the importance of defined fragment combinations for similarity searching is further assessed. Therefore, Feature Co-occurrence Networks are introduced that allow the identification of feature cliques characteristic of individual activity classes. Three differently designed molecular fingerprints are compared for their ability to provide such cliques and a clique-based similarity searching strategy is established. For moleculeand activity class-centric fingerprint designs, feature combinations are shown to improve similarity search performance in comparison to standard methods. Moreover, it is demonstrated that individual features can form activity-class specific combinations. Extending the analysis of feature cliques characteristic of individual activity classes, the distribution of defined fragment combinations among several compound classes acting against closely related targets is assessed. Fragment Formal Concept Analysis is introduced for flexible mining of complex structureactivity relationships. It allows the interactive assembly of fragment queries that yield fragment combinations characteristic of defined activity and potency profiles. It is shown that pairs and triplets, rather than individual fragments distinguish between different activity profiles. A classifier is built based on these fragment signatures that distinguishes between ligands of closely related targets. Going beyond activity profiles, compound selectivity is also analyzed. Therefore, Molecular Formal Concept Analysis is introduced for the systematic mining of compound selectivity profiles on a whole-molecule basis. Using this approach, structurally diverse compounds are identified that share a selectivity profile with selected template compounds. Structure-selectivity relationships of obtained compound sets are further analyzed.
منابع مشابه
Biochemical and computational study of an alginate lyase produced by Pseudomonas aeruginosa strain S21
Objective(s): Alginates play a key role in mucoid Pseudomonas aeruginosa colonization, biofilm formation, and driving out of cationic antibiotics. P. aeruginosa alginate lyase (AlgL) is a periplasmic enzyme that is necessary for alginate synthesis and secretion. It also has a role in depolymerization of alginates. Using AlgLs in cystic fibrosis patients along with anti...
متن کاملA computational method to analyze the similarity of biological sequences under uncertainty
In this paper, we propose a new method to analyze the difference and similarity of biological sequences, based on the fuzzy sets theory. Considering the sequence order and some chemical and structural properties, we present a computational method to cluster the biological sequences. By some examples, we show that the new method is relatively easy and we are able to compare the sequences of arbi...
متن کاملSimulated Heat Integration Study of Reactive Distillation Column for Ethanol Synthesis
Ethanol is largely used as a solvent in the synthesis of varnishes and perfumes. It is also used as a preservative for biological products, fuel, and gasoline additives. Anethanol-water mixture, normally obtained by fractional distillation</span...
متن کاملInvestigation of the Molecules Obtained from Marijuana: Computational Study of Spectral, Structural and Docking
There are many chemical molecules whose names are Cannabigerol (CBG), cannabidol (CBD), cannabichromene (CBC), tetrahydrocannabivarin (THCV), cannabigerovarinic acid (CBGV) and cannabidiolic acid (CBDA) derived from Marijuana. Theoretical methods were used to compare the chemical and biological activities of the six major molecules. Molecules were compared with their chemical activities using m...
متن کاملPetra, osiris and molinspiration: A computational bioinformatic platform for experimental in vitro antibacterial activity of annulated uracil derivatives
Annulated pyrano[2,3-d]pyrimidine/pyrano[2,3-d]uracil derivatives were synthesized using aromatic aldehydes, active methylene compounds and barbituric acid in presence of dibutylamine (DBA) catalyst in ethanol as solvent. The different substituents on phenyl ring in the fused pyrano uracil skeleton showed productive influence on its antimicrobial activity against some gram positive and gram neg...
متن کاملThe Prediction of Thermo Physical, Vibrational Spectroscopy, Chemical Reactivity, Biological Properties of Morpholinium Borate, Phosphate, Chloride and Bromide Ionic Liquid: A DFT Study
In the light of computational chemistry, based on morpholinium cation-based Ionic Liquid, their different types of physical, chemical, and biological properties is highlighted. The physical properties are evaluated through the Density Functional Theory (DFT) of Molecular Mechanics and also examine the chemical and biological properties. The difference between Highest Occupied Molecular Orbital ...
متن کامل